Name | Version | Summary | date |
grag |
0.0.1b0 |
A simple package for implementing RAG |
2024-04-26 21:32:44 |
nncf |
2.10.0 |
Neural Networks Compression Framework |
2024-04-25 12:01:53 |
optimum-intel |
1.16.1 |
Optimum Library is an extension of the Hugging Face Transformers library, providing a framework to integrate third-party libraries from Hardware Partners and interface with their specific functionality. |
2024-04-25 08:06:39 |
optimum |
1.19.0 |
Optimum Library is an extension of the Hugging Face Transformers library, providing a framework to integrate third-party libraries from Hardware Partners and interface with their specific functionality. |
2024-04-16 13:44:57 |
vector-quantize-pytorch |
1.14.7 |
Vector Quantization - Pytorch |
2024-04-16 03:53:46 |
sparseml-nightly |
1.8.0.20240404 |
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models |
2024-04-04 13:52:43 |
sparsezoo-nightly |
1.8.0.20240401 |
Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes |
2024-04-01 15:48:39 |
autoawq |
0.2.4 |
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. |
2024-03-24 11:45:19 |